CDS
Accession Number | TCMCG075C04896 |
gbkey | CDS |
Protein Id | XP_017972092.1 |
Location | complement(join(1778590..1778691,1778815..1779093,1779290..1779568,1779712..1779930,1780317..1780485,1780773..1780947,1781028..1781139,1781249..1781476)) |
Gene | LOC18607299 |
GeneID | 18607299 |
Organism | Theobroma cacao |
Protein
Length | 520aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018116603.1 |
Definition | PREDICTED: squalene monooxygenase [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | CH |
Description | squalene monooxygenase activity |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction |
R02874
[VIEW IN KEGG] |
KEGG_rclass |
RC00201
[VIEW IN KEGG] |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] |
KEGG_ko |
ko:K00511
[VIEW IN KEGG] |
EC |
1.14.14.17
[VIEW IN KEGG]
[VIEW IN INGREDIENT] |
KEGG_Pathway |
ko00100
[VIEW IN KEGG] ko00909 [VIEW IN KEGG] ko01100 [VIEW IN KEGG] ko01110 [VIEW IN KEGG] ko01130 [VIEW IN KEGG] map00100 [VIEW IN KEGG] map00909 [VIEW IN KEGG] map01100 [VIEW IN KEGG] map01110 [VIEW IN KEGG] map01130 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGGCTTACCAGTACATAGTTGGAGGAGTGATAGCTTCTCTTCTGGGGTTTGTTTTCTGGTACAATTCTTTGGTAAGAGAACTCAAGAAGACAAGAACATCATCTATGGAGTTCCCAGTGGAAAACCGTGTGAAGAAAACCGGAAACGGCGAGGTTGCTGGGGCTATAGATGTTATCATAGTCGGTGCCGGAGTTGCAGGTGCTGCTCTTGCCTATACTCTTGGAAAGGACGGACGTCGAGTGCATGTGATCGAGAGAGAGTTAAACGAACCTGACAGAATTGCTGGTGAAGGTCTATTACCAGGAGGCTACGTCAAGTTAACTGAGTTAGGCCTGGAGGATTGTGTAGCTGGGATTGATGCTCAGCGAATTTTGGGTTATGATCTGTACAAGGATGGAAAGAGTACCAAGATATCTTTTCCCCTGGAAAAATTTCAGTCCCATGTGGCTGGAAGAACCTTCCATAATGGACGCTTTGTACAAAAGTTGCGGGAGAAAGCTGCATCTCTTCCCAATGTAAATCTAGAACAAGGGACAGTAACATCTCTGCTTGAAGAAAATGGGACTATTCTAGGAGTGCATTACAAAAACAAGAGTGGTCAAGAGCTGACAGCATCGGCTCCCCTCACCATTGTCTGCGATGGTGGATTCTCAAATTTGAGACGCTCTCTCTGCTACCGTAAGGTTGATATCCCCTCTTATTTTGTTGGTTTGGTTCTGGAGAACTGTAAACTGCCGCATGCAAATTATGGAGCTATTATACTGGAAGATCCTTCACCTATCTTGTTTTATCCTATTAGCAGCACTGAAATTCGTTGCTTGGTTGATGTACCTAGCCAAAAACTACCTTCTGTTTCAGGTGGTGAAATGGCCCATTTCTTAAAAACTGTGATAGCTCCCAAGATTCCTCCTGAACTATACACTGCCTTTATCTCTGCAGTAGAGAAGCAGAACAACATAAGAACTATGGCGAATAGAACCATGCCAGCAGCTCCACTCCCTACTCCTGGTGCACTTTTGATGGGTGATGCATTCAATATGCGACATCCTATAACCGGAGGAGGAATGACTGTTGCACTATCTGATGTTGTTGTGATAAGGGATCTTCTAAGACCCTTGCACAATCTAGGTAATGCATCGGCAGTTTGCAGATATCTTGAATCTTTTTATACCCTGAGGAAGCCAATGGCATCTACGATAAATACGTTGGCTGACACCCTACACAAGGTATTTAGTGCCTCGTCTGATCCTGCAATGGAGCAAATGCAACAAGCATGTTTCGGCTATTTGAGTCTTGGAGGCATATTTTCAAATGGACTATCATCTCTACTCTCTGGTCTGTACCCTCGTCCATCAAGCTTAGCATTTCACTTCTTTGCCATGGCAGTGTATGGTGTTGGCCGGTTGTTACTTCCATTTCCTTCTCCCAACCGCATTTGGACCGGGGCTAAACTGATTTGGGTTGCATCAGGTATCCTTTTCCCCCTTATAAAGTCTGAAGGAGTCAGACAAATGTTTTTCCCTCTAACTGTGCCAGCATACTACAGAGCTCCTCCCCTCTAA |
Protein: MAYQYIVGGVIASLLGFVFWYNSLVRELKKTRTSSMEFPVENRVKKTGNGEVAGAIDVIIVGAGVAGAALAYTLGKDGRRVHVIERELNEPDRIAGEGLLPGGYVKLTELGLEDCVAGIDAQRILGYDLYKDGKSTKISFPLEKFQSHVAGRTFHNGRFVQKLREKAASLPNVNLEQGTVTSLLEENGTILGVHYKNKSGQELTASAPLTIVCDGGFSNLRRSLCYRKVDIPSYFVGLVLENCKLPHANYGAIILEDPSPILFYPISSTEIRCLVDVPSQKLPSVSGGEMAHFLKTVIAPKIPPELYTAFISAVEKQNNIRTMANRTMPAAPLPTPGALLMGDAFNMRHPITGGGMTVALSDVVVIRDLLRPLHNLGNASAVCRYLESFYTLRKPMASTINTLADTLHKVFSASSDPAMEQMQQACFGYLSLGGIFSNGLSSLLSGLYPRPSSLAFHFFAMAVYGVGRLLLPFPSPNRIWTGAKLIWVASGILFPLIKSEGVRQMFFPLTVPAYYRAPPL |